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DIRECTIONAL POINCARE INEQUALITIES 
ALONG MIXING FLOWS 

STEFAN STEINERBERGER 


Abstract. We provide a refinement of the Poincare inequality on the torus there exists a 
set B oi directions such that for every a ^ B there is a Cq; > 0 with 

l|V/||^^(V)ll > Cc.||/|ll 2 (Td) for all / S with mean 0. 

The derivative (V/, a) does not detect any oscillation in directions orthogonal to a, however, 
for certain a the geodesic flow in direction a is sufficiently mixing to compensate for that defect. 
On the two-dimensional torus the inequality holds for a = (l,\/2) but fails for a = (l,e). 
Similar results should hold at a great level of generality on very general domains. 


1. Introduction and main result 

1.1. Introduction. The classical Poincare inequality on the torus states 

l|V/llL2(T<i) > ll/llL2(T<i) 

for functions / £ with vanishing mean. A natural interpretation is that a function with 

small derivatives cannot substantially deviate from its mean on a set of large measure. The purpose 
of this paper is to derive a substantial improvement; we first state the main result. 

Theorem 1 (Directional Poincare inequality). There exists a set B G such that for every 
a € B there is a Ca > 0 so that 

l|V/||l^(V^)|| (V/,a) > c„||/|ll.(T.) 

for all f £ with mean 0. If d> 2, then B is uncountable hut Lebesgue-null. 

The exponents are optimal. The proof is simple and based on elementary properties of Fourier 
series - we believe it to be of great interest to understand under which conditions comparable 
inequalities exist on a general Riemannian manifold (M, g) equipped with a suitable vector field. 



Figure 1. A well-mixing flow transports (dashed) every point relatively quickly 
to a neighborhood of every other point. 
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On the torus, the inequality has strong ties with number theory and can be easily derived at the 
cost of invoking highly nontrivial results (Schmidt’s result on badly approximable numbers, the 
Khintchine theorem). One remarkable feature is that the inequality holds for a set of Lebesgue 
measure 0 which shows it to be very delicate (however, as is explained below, slightly weaker 
statements seem to be very robust). A natural interpretation of the inequality seems to be the 
following: given two nearby points x,y G for which f(x) ^ f(y), the classical Poincare 
inequality will detect a large gradient between them. The term | (V/, a) | might not detect the 
large gradient but following the ergodic vector field will relatively quickly lead to a neighborhood 
of y. A priori being in a neighborhood might not imply much because there could be still local 
oscillations on the scale of the neighborhood, however, since we also invoke a power of || V/||i 2 (Td), 
this controls the measure of the set on which local oscillations have a strong effect. This heuristic 
suggests strongly that similar inequalities should hold at a much greater level of generality. We 
discuss and prove some natural variants in the last section. 

1.2. Open problems. It would be of great interest to understand to which extent such inequali¬ 
ties can be true in a more general setup. It is not even clear to us whether comparable inequalities 
hold in L^(T'^). Generally, for suitable vector fields V on suitable Riemannian manifolds (M,g) 
it seems natural to ask whether there exists an inequality of the type 

liv/ll 

Lp{M) II fyY) /llip(M) ^ c||/||^p(^) 

for some J > 0 and all / G IT^’^(M) with mean 0. The parameter 5 can be expected to be related 
to the mixing properties of the flow - it is difficult to predict what the generic behavior on a fixed 
manifold might be (say, for a smooth perturbation of the flat metric on the torus). On we can 
rephrase the Khintchine theorem m as a statement about generic behavior for the flat metric. 

Theorem (Khintchine, equivalent). For every 6 < 1/2, the set of a for which there exists a 
Cq > 0 such that 

yfGH^r) J^jix)dx = 0 ||V/||[^fT,.)||(V/,a)/||i 2 (Tr 2 ) >c„||/|L 2 (t.) 

has full measure. 

This suggests <5 < 1/2 as a natural threshold that might be achievable by other two-dimensional 
examples - however, since the inequality is extremely delicate on T^, the manifolds on which 
the inequality holds with <5=1/2 might actually be very rare. One would expect topological 
ingredients to arise on the sphere equipped with a nontrivial vector field: the hairy ball theorem 
dictates that any smooth vector field vanishes for even d and this will necessitate a change of scaling 
in the inequality since a function / could be concentrated around the point in which the vector 
field vanishes. Furtermore, while not every nonvanishing vector field on has to have a closed 
orbit (i.e. Seifert’s conjecture is false), many of them do - this puts topological restrictions on 
what directional Poincare inequalities are possible (since one could set a function to be constant 
along a periodic orbit and decaying extremely quickly away from it). However, there should be 
a variety of admissible inequalities on the flat infinite cylinder {M,g) = (K x T‘^“^,can) and this 
could be a natural starting point for future investigations. 


2. Proof of the Statements 

2.1. Outline of the argument. The proof of the classical Poincare inequality on the torus is a 
one-line argument if one expands in Fourier series and uses /(O) = / = 0 since 


l|V/||i2(T.) = (2^)^^ ^ > (2^)^ ^ lafcp = \\f\\h^^.y 

k^Q fc^O 
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The argument also highlights the underlying convexity of the quadratic form. Our proof will be a 
direct variation of that result and uses the observation that 





^ ak{k,a)e^’^-^ 
feeZ'* 


2 


L2(T<i) 


2 


L2(T‘i) 


fceZ'^ 


This Fourier multiplier is not uniformly bounded away from 0 and will even vanish for certain 
k € ii the entries of a are not linearly independent over Q. If the entries of a are linearly 
independent over Q, then the Fourier multiplier is always nonnegative but we have no quantitative 
control on its decay (see below for an example). However, if we restrict the support of the Fourier 
coefficients to be within [k £ : \k\ < i?}, then trivially 

ll(V/,a)||i.(T.)> I \{k,a)\A \\f\\h^^.y 
\|fc|<R J 

The term in the bracket clearly has great significance in the study of geometry of numbers and 
has been studied for a long time. It suffices for us to apply the results and use the additional 
IIV/||^ 2 (']f<i) expression to ensure that a fixed proportion of the mass is contained within a 
suitable ball of frequeny space on which to apply the argument. 


2.2. Number theoretical properties. We now discuss subtleties of the inequality in greater 
detail: it is merely the classical Poincare inequality for d= 1. Letting d = 2 with a = (1, 0) yields 

l|V/||L2(T2)||9a;/||L2(T2) > c||/||^ 2 (t 2 ) whlch Is obvlously false, 

because / might be constant along the cc—direction and vary along the y—direction. More gen¬ 
erally, the inequality fails for any a with entries linearly dependent over Q and the functions 
sin (fcix-b fc 22 /) for any ki,k 2 € with ((ki, k 2 ), a) = 0 serve as counterexamples. The next 
natural example is a = (v^, 1). Suppose / £ C°°(T) and 

I (V/, (72,1)) 

/ is constant along the flow of the vector field (72,1) but every orbit is dense and thus / = 0. 
This is true for any vector with entries that are linearly independent over Q, however, it is not 
enough to prove the inequality itself: it fails for (1, e) on despite linear independence. A simple 
construction for d = 2 shows that linear independence of the entries of a. is not enough: let 

a = ( 1, I ~ (1,0.110001...) 

where the arising number, Liouville’s constant, is known to be irrational. If we set 

fNix, y) = sin 1^10"^’ ^ ^ , 

then 

||/w||i2(T2) = 27 and II V/Ar||L2(T2) < 6 • lO"^' 

while 

ll(V/iv,a)l 72 (T 2 ) = 7^ for TV >3. 

Vn^AT+l ^ / 
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2.3. An explicit example. The inequalities are not only sharp with respect to exponents, they 
are actually sharp on all frequency scales. This is in stark contrast to classical Poincare-type 
inequality which tend to be sharp for one function (the ground state of the underlying physical 
system): here, we can exclude all functions having Fourier support in the set : |^| < N} for 
arbitrarily large N and still find functions for which the inequality is sharp (up to a constant). 
We explain this in greater detail for d = 2 with the admissible direction given by the golden ratio 


a = 




e B. 


Consider the sequence of functions given by 


fn{x, y) = sin (F„+ia; - F^y), 


where Fn is the n—th Fibonacci number. An explicit computation shows that 


IIV/, 


n||L2(T2) 


dxfn 


l + VE 


dyfn 


IF, 


L2(T2) 


n+1 


-k 1 


F 


n+1 


A standard identity for Fibonacci numbers gives that 


1 + V5 

2 


F^Wfnl 


2 

L2(T2) 


lim 

n—¥oc 


Fn+1 

~F^ 


1 + V5 

2 " 75’ 


which implies that 


lim IIV/„||i 2 (T 2 ) 

n—^oo 


dxfn 


1 + V5 


dyfn 


L2(T2) 


ll/nll 


-2 

L2(T2) 


5 -I- 75 _ |q:| 
10 “ 75' 


Since these functions /„ have their Fourier transform supported on 4 points in and since 
F„+i/K^( 1 + 75)/2<2, we can conclude that every dyadic annulus in Fourier space contains 
an example for which the inequality is sharp (up to a constant). Put differently, our inequality 
is close to being attained on every frequency scale. This sequence of /„ has the advantage of 
simultaneously showing that the following statement is sharp. 


Proposition. Let d = 2 and a S be any vector for which 

I|V/|U2(t 2)|1 (V/,a) |U2 (t 2) > c7|/||i2(T2) 

holds for all f € iJ^(T^) with mean 0. Then the constant satisfies 

\a\ 


Cry ^ 


75' 


Up to certain transformation, the example above is essentially the only example for which the 
inequality is tight: normalizing a = (l,/3), the example shows that the inequality is sharp for 
ft = {1 + 75)/2 and there are only countably many other /3 for which it is sharp (that can be 
explicitely given). For all other numbers the upper bound could be improved to Ca < |a|/78. 
Removing yet another countable set of exceptional directions, we could replace 78 by 7221/5 
and the process could be continued (this follows from classical results about the structure of the 
Markov spectrum, see m- 


2.4. Badly approximable systems of linear forms. We now introduce the relevant results 
from number theory. Let Li,..., : Z'^ —>■ K. be defined as 

Li(x) = anxi + . ..aidXd = (ai,x) 


Lf (x) = aaxi F . ■. auXd = , x) 
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The relevant question is whether it is possible for all £ expressions to be very close to an integer. 
Using ll-ll : R —>• [0,1/2] to denote the distance to the closest integer, the pigeonhole principle 
implies the existence of infinitely many x G with 

max(||Li(x)||,..., ||TKx)||) < (max(|a:i|,..., \xd\))~~^ ■ 

Dirichlet’s theorem cannot be improved in the sense that there actually exist badly approximable 
vectors Oi,..., such that for some c > 0 and all x £ Z*^ 

max(||Li(x)||,..||Lf(x)||) > c (max(|a;i |,..., jxdl))”" • 

The existence of such elements was first shown by Perron |15j . Khintchine m has shown that the 
dimensional Lebesgue measure of such tuples (oi,... ,ai) is 0 and Schmidt [TH] has proven 
that their Hausdorff dimension is id. 

2.5. Proof of Theorem 1. 

Proof. We will prove the statement explicitely for the following set: for any (d — 1)—dimensional 
badly approximable vector a^-i, consider the linear form L : —>• R given by 

L(x) := (od-i.x) 


and 

a = (1, ad-i). 

Recall that ll-jl : R —>■ [0,1/2] denotes the distance to the closest integer and is trivially 1-periodic. 
Let now k G with k yi 0. If fc vanishes on all but the first component, then 

1 {a,k) 1 = [fell > 1. 

If k does not vanish on all but the first component, then 


1 {a, k) 1 = Ifci + L((fc2,..., kd))\ > \\L{{k ^,..., kdM > 


max(lfc2|, ■ • ■, [fc]'^-!’ 

where c > 0 is some constant coming from the fact that a^-i is badly approximable. Let now 

and note that oq = 0 because / has mean value 0. We have 

II (V/,a) ||i.(T.) = (2^)^^ ^ \a,\^\{k,a) [^ > 0^(2^)" ^ 

fceZ'i ' ' 




It is easy to see that 


E 


|afc| < 


llL2(T<i) 


because the opposite inequality would imply that 

iiv/iii2(T.) = E ifci'i«'=i' ^ E 


\k\'^\ak\ 




L 2 (T‘^) 


> 4 


l|V/f 


L2(Td)2 


L2(T<i)2 IIV/II 2fT<ll2 

|fc|>2 llfll ^ 

iiriq2(Trf)2 


E l«'^l' ^ 2llV/lli2(T.), 
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which is absurd. Altogether, we now have 


(V/,a) ||i2(T<i) > Y. nSA ^ II 


Ofc 


\k\2d-2 -- - y—> ^ |fc|2d-2 

feeZ'* fcez'i 


> 




ll/ll £,2(i-ti) 
| 2 c /-2 

lL2(T‘i) 




E 

ll/ll jr^ 2 (T<i) 


|afe| 


^ C^{2t:Y ll/llL2(T‘i) ll/llL2(T‘i) 

- 22^-2 WVfWY,-^,^ 2 


Rearranging gives the result. 


□ 


We remark that a classical insight of Liouville allows to give a completely self-contained proof in 
the most elementary case. If we pick a = (v^, 1), then the only information required to make the 
above argument work is the existence of a c > 0 such that 

I (fc, a) I = |fci-\/2-I-fel > E for all fc G Z^. 

|fc| 

However, this follows at once with c = 1/3 from 

1 < \2kl — A:|| = \(\f2ki — k2){\/2ki + k2)\ < 3|fc||(-\/2A;i -|- ^2)1- 

A similar argument works for any (l,a) with a algebraic over Q (Liouville’s theorem). More 
generally, a classical characterization of badly approximable numbers in one dimension as those 
numbers with a bounded continued fraction expansion implies that our proof works for 

a = (l,/3) G 

if [3 has a bounded continued fraction expansion. A theorem of Lagrange (see e.g. m) implies 
that this is always the case if /3 is a quadratic irrational. Moreover, this characterization is sharp 
on T^: if ,5 has an unbounded continued fraction expansion, then the inequality is not true for 
(l,/3) and the sequence 

Ux) = 

with kn =(numerator, -denominator) of rational approximations of /3 coming from the continued 
fraction expansion will serve as a counterexample. A very interesting special case is Euler’s 
continued fraction formula for e (see, e.g. |10]L which implies that e has an unbounded continued 
fraction expansion and that the inequality with a = (1, e) fails on T^. 


2.6. Proof of the Proposition. 

Proof. The direction a has to have both entries different from 0. We use a classical result of 
Hurwitz [7] which guarantees the existence of infinitely many k € 1? with 

^ < 1 
02 ^2 ~ \/5fc| 

For any such (fci, ^ 2 ), this can be rewritten as 

|aifc2 - 02 ^: 1 ! < . 

v5|A:2| 

We now consider f{x) = . Simple computation yields 

I|V/|U.(t 2)|| (V/,a) |U.(t2) < 
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However, as |fc| —>■ oo, we have that ki/k 2 —>■ aija^ and thus 


\o‘2\\k\ 

\k2\ 



lazi 



+ 1 = |a|. 


□ 


The constant in the result of Hurwitz is sharp for the golden ratio a = ai/a 2 = (1 + •\/5)/2 = </>. 
Moreover, it is known (see e.g. [2]) that for every 

a £ K \ Q which is not of the form a,b,c,d G Z lad — bc\ = 1, 

c(p + d 

the constant y/E could be replaced by Our example showing the sharpness of the Proposition 
using Fibonacci numbers was therefore, in some sense, best possible. 

2.7. Fractional derivatives. As is obvious from the proof, fine properties of the derivative did 

not play a prominent role, indeed, the proof really only requires an understanding of how fast the 
induced Fourier multiplier grows. This allows for various immediate generalizations. We introduce 
pseudodifferential operators P{D) on via 

P{D) ■■= Y. akP{k)P^-^ 

the same proof can immediately be applied as long as \P{k)\ —>■ oo if |A:| —>■ oo. One example on 
would be that for a £ S and all s > 0 

11^ /IIl2(T2)|| ^ ||2,2(t 2) > Ca||/||^2(x2) 

which is again sharp by the same reasoning as above. The following variant was proposed by 
Raphy Coifman: if we define 

Y, := ^ Ukklkr^P'^-^, 

fceZ'i 

then one can always observe substantial fluctuations in the (d — 1)—th derivative along the flow 

||(-D‘^/,a)||i2(Tr.i) > Ca||/||i2(Td). 

2.8. Several ergodic directions. We can also derive a statement for more than one ergodic 
direction. The same heuristics as above still apply: the main difference is that incorporating a 
control in more than one ergodic direction poses additional restrictions and requires less global 
control in the sense that a proportionately smaller power of || V/|| is necessary. 

Theorem 2. Let 1 < £ < d — 1. Then there exists a set Be £ such that for every 

(oi, 02 ,..., ae) £ Be there is a Ca > 0 with 


^ with £ associated linear forms 
L, : Z'^-i —>■ ffi. via Li = such that they form a system of badly approximable linear forms 

and set 

= ( 1 ) Pi) 


iiv/rAi-, (i: II (V/, Oi) ||L 2 (Td) J 

for all f £ with mean 0. 

Proof. We consider .^ < d — 1 vectors Pi, p 2 , ■ ■ ■, Pi from 


ae = {l,Pe) 

The same reasoning as before (distinguishing between k vanishing outside of the first component 
or not) implies again for every single 1 < i < £ 

I {ai,k) I = \ki +Li{{k2,...,kd))\ > \\Li{{k2,... ,kd))\\ 






from which we derive that whenever k is not concentrated on the first component 


I {ai,k) I > max(||Li(fc)||,..., \\Li{k)\\) > c (maxdfcal,..., \kd\)) 


If k is concentrated on the first component, we get a bound of i, which is much larger. For 


f=Y 

fc^O 


a simple computation shows that 


i /i 

Y II 1®'=!^ (1 1^ 

i=l feGZ'* \i=l 


> E l®fep (max(|fci|,. 


feGZ"^ 







The rest of the argument proceeds as before; finally, we recall that any two norms in finite¬ 
dimensional vector spaces are equivalent and thus, up to some absolute constants depending only 
on £, 

i \ ^ ^ 

EII E II iil^iT-^) 

2^1 / \i^l 

and the result follows. □ 


e 



2.9. The Hausdorff dimension. As is obvious from the proof, we are not so much interest in 
the distance to the lattice but care more about the distance to the origin. It seems that there is 
ongoing research in that direction laiHiE], which is concerned with establishing bounds on the 
dimension of the set 

max(|Li(x)|,..., |T^(x)|) > c(max(|xi|,..., , 

where | • | is the absolute value on R. For any such system of linear forms given by ai,... ,ai 
satisfying that inequality, we can improve Theorem 2 with the same proof to 


iiv/r-' 


L2(T<i) 


E II llL2(T<i)^ 


> Call/ll 


d 

L^{T^) 


\i=l / 

for all / G with mean 0. We also remark the following simple proposition (the essence of 

which is contained in M)- 


Proposition. Let d > 2. a G is admissible in Theorem 1 if and only if there exists A G R 
such that 

a = (A, A/3) 

and P is badly approximahle linear form in 


Using the result of Schmidt [18], we see that the Hausdorff dimension of the set of badly approx- 
imable vectors in R'^"^ is d — 1 and the construction increases the dimension by 1. The Hausdorff 
dimension of the set of admissible vectors in Theorem 1 is therefore d. The argument is indirectly 
contained in the earlier proofs. 
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2.10. Variants. This subsection is concerned with inequalities of the type 

liv/ll 

L2(T2) II (^/) Q^) /lli2(T2) > c||/||^2(i'2) 

for some 0 < <5 < 1/2. The case <5 = 1/2 was discussed above and following the same arguments 
immediately imply that <5 > 1/2 is impossible. However, the threshold 5 = 1/2 is also sharp. 

Theorem (Khintchine). For every 6 < 1/2, the set of a for which there exists a Ca > 0 such 
that 

liv/ll 

L2(T2) II (^/) Q^) /lli2(T2) > c||/||^2(t2) 

holds for all f G L^(T^) with mean 0 has full measure. 


Another celebrated result in Diophantine approximation is the Thue-Siegel-Roth theorem stating 
that for every irrational algebraic number a and every e > 0 we have 


V 

a - 
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> 


q2+e 


for some Cq, > 0. This immediately implies, along the same lines as above, that for every vector 
of the form a = (l,/3) with {3 being an irrational algebraic number and every e > 0 

l|V./||^/2(^2")|| (V/,a) /ll^/"-^) > c„,,||/||^2(t2). 

Recall that the inequality does probably not hold for a = (l,7r) because tt is probably not badly 
approximable. However, there are weaker positive results. A result of Salikhov m implies the 
existence of a c > 0 such that 


Repeating again the same argument as above, we can use this to derive 

\\^f\\li%.)\\ (V/,(l,^)) ||^/(t2) > c||/|L2(t2). 

Similarly, a result of Marcovecchio m shows 

iiv/iii'2/(;®2)ii (V/, (i,iog2)) > c||/il2(t2) 

and similar results are available for other numbers (i.e. tt^, log 3, ({3),.. 


3. Uniform Poincare inequalities via diffusion 

There exists a closely related problem, where we can try to substitute a directional derivative by a 
random process (that points roughly in the direction of its mean but may also behave differently). 
Let T = R \ Z be the one-dimensional torus T, let / G LP(T) and consider the averaging operator 
At : LP(T) LP(T) given by 

iAtf)ix) = fix + y)dy. 

An iterative application of At corresponds to a time-discrete diffusion. These objects have strong 
and natural ties to Poincare inequalities (see, for example, the recent book by Bakry, Gentil and 
Ledoux my One natural question is whether ||/ — Atf\\i^p(j^ can be small if / is not constant. 
Motivated by a problem of Schechtman, this was recently investigated by Nayar & Tkocz El- 
They studied the problem in the general case of convolution with a T—valued random variable Y 
which for some £ G N 

-I- U 2 H- FYi has a nontrivial absolutely continuous part. 

Theorem 3 (Nayar & Tkocz). There exists a constant c > 0 depending only on Y sueh that for 
all f G LP{T), 1 < p < 00 , with / = 0 

\\f{x)-Ef{x + tY)\\Lp^T^ > cty\f\\Lp(^jy 
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We believe the inequality to be of quite some interest; the example above corresponds to V being 
uniformly distributed on T but the setting is clearly more general than that. It is easy to see that 
the inequality is sharp: for a fixed C^(T)—function / and Y being again the uniform distribution 
it follows from a Taylor expansion that 


lim 

t —^0 


Wf-Atfh. 


IwniL.. 


Using C'^(T) ^ LP{T), the statement immediately implies a Poincare inequality 


I1/”||lp(T) > c||/||iP(T) 


with a constant c > 0 that does not depend on p. There is a natural variant: note that for any 
f G the case EU ^ 0 yields a first order contribution 


t —^0 t 


{EY)f'ix). 


This suggests that the cases E(U) = 0 and E(U) ^ 0 behave differently: from the point of view of 
diffusion, the case ET ^ 0 turns the averaging operator Ef{x + tY) into an operator with drift, 
which allows for a faster exploration of the torus. We prove that this is indeed the case. 


Theorem 4. Assume EU ^ 0. Then there exists a constant c > 0 depending only on Y such that 
for all f G LP(T), 1 < p < oo with f = 0 

\\f{x) - Ef{x + tr)llLP(T) > Ct||/||LP(T). 

It is easy to see that the statement cannot be improved: consider Y ^ W([0,1/2]) to be the uniform 
distribution on [0,1/2] and let / = X[o,i/ 2 ] ~ X[i/ 2 ,i]- Then ||/||lp(t) = 1 and for all 1 < p < oo 

\\f{x)-Ef{x + tY)U,^r)-^t. 

By taking the limit t —>■ 0, we obtain 

||/'||lp(T) > c||/||iP(T) 

with a constant c > 0 that does not depend on p. We will first prove the statements and then 
comment on the relation with the material presented above. 

3.1. The proof of Nayar & Tkocz. We start by giving a summary of the proof given in m 
since our argument for Theorem 1 is based on that approach. We write the averaging operator 

f{x) —>■ Ef{x + tY) as a convolution f ^ f * p. 

If 11/ “ / * mIIlp(t) is small, then f * p ^ f and repeating the argument suggests that 
\\f * p — f * p* pIIlp(t) should be small as well. 

Since p comes from a probability measure. Young’s convolution inequalities implies that 

\\f * p - f * P* p\\lp{T) = \\{f - f * P) * P\\lp{T) <\\f - f * Ai||LP(T)- 
The next ingredient is the triangle inequality allowing us to compare 


11/ - / * mIIlp(T) > ^ (11/ - / * mIIlp(T) + \\f*p-f* p'^\\lp{T) H-H 11/ * /i” ^ - f* m"IIlp(t)) 

> -II/-/*m"IIlp(t), 

n ^ ^ 


where 


k 

p := p* p* ■■■ * p 

'• -V-' 

k times 

is the fc—fold convolution of p with itself. The final ingredient of the argument is the following: if n 
is sufficiently large, then the n—fold convolution p." is very well-behaved: its absolutely continuous 
part satisfies a uniform lower bound. However, if p" > c in a pointwise sense and / has mean 0, 
then [Ml Lemma 2] 


11 / * P\\lp{T) < (1 - c)||/||lp(t) 
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and therefore 

11/ - / * MIIlp(T) > -\\f - f * M”I|lp(T) > - (I1/I|lp(T) - 11/ * M"I|lp(T)) > -||/||lp(T)- 



Figure 2. Flow of an expanding heat kernel along a geodesic. 


It remains to find the right value of n for which /r" > c. It is easy to see that this generally scales 
like n ^ 1/t^ for sufficiently nice /i and it remains to show that /i" is sufficiently nice for n large 
enough assuming that has an absolutely continuous part for some £. Summarizing, the heat 
kernel flattens and will eventually cover the entire torus; this requires roughly I/t^ convolutions 
which follows from the usual scaling that the variance of scales as \/k. It is not surprising that 
a drift should allow for this exploration of the torus to happen at a different time scale. 

3.2. Random variables with drift: proof of Theorem 4. 

Proof. The argument of Nayar & Tkocz used the contraction property to establish 

11/ - / * m'‘IIlp(T) < ^11/ - / * mIIlp(T)- 

We will work with an averaged version of that statement: summing from k = I,..., n, we get 

1 " 

- X! 11-^ “ ^ * /IUp(t) < n\\f - f* Ai|Up(T)- 

' k=l 

Another application of the triangle inequality implies 

1 "■ 

^ - - f * yWLPiT) < n\\f - f* Ai||Lp(T)- 

LP(T) k=l 

The conclusion of the argument proceeds in the same way (though, obviously by a different 
method): if we can conclude that 

1 " c 

-^y>c pointwise, then ||/- / * mIIlp(t) >-||/IIlp(T‘')- 

k=l 

It remains to show that the statement is asymptotically true for n ^ to conclude the result. 
We show that a result from renewal theory can now be applied relatively easily. Define Fr to 
be Y on [—1/2,1/2] C K (with corresponding measure /xr) and assume furthermore w.l.o.g. that 
EYr > 0. We first note that if some convolution 

= Yi+Y2 + --- + Yi 

on the torus has an absolutely continuous component, then so will Y^ on K: this follows immedi¬ 
ately from the Radon-Nikodym theorem and the fact that the singular measure is supported on 
a set of measure 0 (moreover, Y^ is supported on [—f/2,f/2]). This allows us to apply a result of 
C. Stone [IH]: suppose /xr 

(1) has a finite m—th moment 

(2) /Xg has a nontrivial continuous component for some £ G N 

(3) and if furthermore E/xr > 0 
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then the measure be written as 

OO 

+ ^2, 


where ui is absolutely continuous and has a density p{x) satisfying 


p{x) = 


o(\x\ "") 


l-m^ 

^-l 


as cc —>■ —OO 

(E1r)“^ + o(|a;|“’”) as a; —>■ OO 

and 1^2 is a finite measure such that all integer moments exist. 



Figure 3. Heat kernels with drift. 


The resnlt even says slightly more and allows to conclude exponential decay of U 2 as well as 
p(x) — 1/(EYr) from exponential decay of /iR (which is something that is the case in our setting 
since /xr is compactly supported but we will not use this fact). We now localize that statement 
(so as to avoid summing up to infinity). Note that has a large proportion of its mass centered 
in an interval of length ^ '/k centered around A:(E/xr). This easily implies that, for n sufficiently 
large, i bas an absolutely continuous part with a density p{x) satisfying 

11 Tl 

p{x)>-wrr for all xo < a; <-EFr, 

Z JEIr z 

where xq is a universal number depending only on Y (essentially the threshold beyond which the 
Stone asymptotic becomes effective). We remark that much sharper asymptotics could be derived 
and one could prove the same result for 

xo <x <{n — C'-\/n)EFR for some fixed C > 0 sufficiently large 


but this is not required. Applying this to tlR is very simple because the entire setup scales nicely 
under dilations. If we define pt as the probability measure of tlR. Then pt bas an absolutely 
continuous component Pt(x) satisfying 


/ N 111 

P.W > 27 


for all txo < X < — ^EIr. 

Let now n = 8/(tElR) + 2xo/{tEY^). We can conclude that 

on an interval of length at least 2 on R. 


2 , 111 
Pt(x) > --- 


2 t EYr 

Since n ~ 1/t, this now implies that 


1 


-E 


Mo 


has an absolutely continuous component with density > c 


fc=i 


for some c > 0 (that could be described in terms of quantities above) on an interval of length at 
least 2 on R. By projecting down on the torus, we have that 


1 


/x^ has an absolutely continuous component with density > c 


fe=i 


and this implies the desired result by the reasoning described above. 


□ 
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3.3. Concluding Remarks. Let us try to understand the arguments above in a more general 
context; the natural framework seems to be that of a compact Lie group and random variables 
for which the central limit theorem is valid. We are interested in the largest possible function 
h : K.+ —>■ K.+ such that there exists a constant c > 0 depending only on Y with the property that 
for all / G LP, I <p <oo satisfying J f = 0 

\\f{x)-Efix + tY)\\L.>hit)\\fU.. 

The argument of Nayar & Tkocz implies that if ET = 0 G the best answer is given by h{t) ~ 
and this follows from the fact that the characteristic time for the heat kernel to effectively cover 
the entire torus is of scale If EF ^ 0, then the situation becomes more interesting. 



Figure 4. Location of the mass of the hrst few convolutions /i^ 


Using standard approximations, we may assume that {pt again being the probability measure 
of tY) has most of its mass around tWY in a radius of scale ~ '/t. This may be best understood as 
a moving and expanding bubble. The problem is to understand the number of bubbles N required 
to have 


1 

N 




> c. 


On simple volumetric considerations imply that N > . However, it is also not difficult to 

see that N < is false, even for badly approximable EF: this would imply h{t) > t, which in 
turn would imply a Poincare type inequality 


ll(V/,a)IL.>||/||L., 

which we know to fail even for badly approximable directions. Another interesting example is the 
sphere, where one cannot improve on h{t) > because every geodesic flow is periodic. It seems 
interesting to understand how the precise quantitative behavior in t depend on approximation 
properties of EF and, more generally, how such problems behave in different geometries. 


Acknowledgement. I am grateful to Raphy Coifman for various discussions and to him, Yves 
Meyer and Jacques Peyriere for their encouragement. 
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